Dynamic Rate Allocation in Fading Multiple Access 

Channels 

Ali ParandehGheibi, Atilla Eryilmaz, Asuman Ozdaglar, and Muriel Medard 



Abstract — We consider the problem of rate allocation in a 
fading Gaussian multiple-access channel (MAC) with fixed trans- 
mission powers. Our goal is to maximize a general concave 
utility function of transmission rates over the throughput capacity 
region. In contrast to earlier works in this context that propose 
solutions where a potentially complex optimization problem must 
be solved in every decision instant, we propose a low-complexity 
approximate rate allocation policy and analyze the effect of 
temporal channel variations on its utility performance. To the best 
of our knowledge, this is the first work that studies the tracking 
capabilities of an approximate rate allocation scheme under fading 
channel conditions. 

We build on an earlier work to present a new rate allocation 
policy for a fading MAC that implements a low-complexity 
approximate gradient projection iteration for each channel mea- 
surement, and explicitly characterize the effect of the speed of 
temporal channel variations on the tracking neighborhood of 
our policy. We further improve our results by proposing an 
alternative rate allocation policy for which tighter bounds on the 
size of the tracking neighborhood are derived. These proposed rate 
allocation policies are computationally efficient in our setting since 
they implement a single gradient projection iteration per channel 
measurement and each such iteration relies on approximate 
projections which has polynomial-complexity in the number of 
users. 

I. Introduction 

Dynamic allocation of communication resources such as 
bandwidth or transmission power is a central issue in multiple 
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access channels in view of the time-varying nature of the 
channel and interference effects. Most of the existing literature 
on resource allocation in multiple access channels focuses on 
specific communication schemes such as TDMA (time-division 
multiple access) [1] and CDMA (code-division multiple access) 
[2], [3] systems. An exception is the work by Tse et al. [4], 
who introduced the notion of throughput capacity for the fading 
channel with Channel State Information (CSI) and studied 
dynamic rate allocation policies with the goal of maximizing 
a linear utility function of rates over the throughput capacity 
region. 

An important literature relevant to our work appears in the 
context of cross-layer design, where joint scheduling-routing- 
flow control algorithms have been proposed and shown to 
achieve utility maximization for concave utility functions while 
guaranteeing network stability (e.g. [5], [6], [7], [8]). The com- 
mon idea behind these schemes is to use properly maintained 
queues to make dynamic decisions about new packet generation 
as well as rate allocation. 

Some of these works ([6], [7]) explicitly address the fading 
channel conditions, and show that their policies can achieve 
rates arbitrarily close to the optimal based on a design pa- 
rameter choice. However, the rate allocation imposed by these 
schemes requires that a large optimization problem requiring 
global information be solved over a complex feasible set in 
every time slot. Clearly, this may not always be possible due 
to the limitations of the available information, or the processing 
power, or the complexity intrinsic to the feasible set. In fact, 
even in the absence of fading, the interference constraints 
between nearby nodes' transmissions may make the feasible set 
so complex that the optimal rate allocation problem becomes 
NP-hard (see [9]). 

In the absence of fading, several works have proposed 
and analyzed approximate randomized and/or distributed rate 
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allocation algorithms for various interference models ([10], [5], 
[11], [9], [12], [13]), and their effect on the utility maximization 
is investigated in [9], [14]. However, no similar work exists 
for fading channel conditions, where the changes in the fading 
conditions coupled with the inability to solve the optimization 
problem instantaneously make the solution much more chal- 
lenging. In fact, it is not even clear what algorithm can be used 
to achieve close to close-to-optimal performance. 

In this work, we propose an approximate gradient projection 
method and study its tracking capabilities when the channel 
conditions vary over time. In our algorithm, the solution is 
updated in every time slot in a direction to increase the 
utility function at that time slot. But, since the channel may 
vary between time-slots, the extend of these temporal channel 
variations become critical to the performance. We explicitly 
quantify the impact of the speed of fading on the performance 
of the policy, both for the worst-case and the average behavior. 
Our results also capture the effect of the degree of concavity 
of the utility functions on the average performance. 

Other than the papers cited above, our work is also related to 
the work of Vishwanath et al. [15] which builds on [4] and takes 
a similar approach to the rate and power allocation problem for 
linear utility functions. Other works address different criteria 
for resource allocation including minimizing the weighted sum 
of transmission powers [16], and considering Quality of Service 
(QoS) constraints [17]. 

The remainder of this paper is organized as follows: In 
Section II, we introduce the model and describe the capacity 
region of a multiple-access channel. In Section III, we consider 
the utility maximization problem in fading channel, and present 
a rate allocation policy and characterize the tracking neighbor- 
hood in terms of the maximum speed of fading. In Section IV, 
we provide an alternative rate allocation policy and provide a 
bound on the size of tracking neighborhood as a function of 
the average speed of fading. Finally, we give our concluding 
remarks in Section V. 

Regarding the notation, we denote by Xi the i-th component 
of a vector x. We denote the nonnegative orthant by M™ , i.e., 
M™ = {x e R™ I x > 0}. We write x' to denote the transpose 
of a vector x. The exact projection operation on a convex set 
is denoted by V. 



II. System Model 

We consider M users sharing the same media to communi- 
cate to a single receiver. We model the channel as a Gaussian 
multiple access channel with flat fading effects 



M 



i=l 



(1) 



where Xi(n) are the transmitted waveform with average power 
Pi, Hi(n) is the channel state corresponding to the i-th user at 
time slot n, and Z(n) is white Gaussian noise with variance 
Nq. The channel state process is assumed to be ergodic and 
bounded. We also assume that the channel states are known to 
all users and the receiver Q. Throughout this work we assume 
that the transmission powers are fixed and no prior knowledge 
of channel statistics is available. 

We model the speed of fading as follows: 

|fl"i(n + 1) - Hi(n)\ = V* for all n, i = l,..., M, (2) 

where V£ is a nonnegative random variable bounded from above 
by v\ and {V^} are independent identically distributed (i.i.d.) 
for fixed i. Under slow fading conditions, the distribution of 
V£ is concentrated around zero. 

We first consider the non-fading case where the channel state, 
H, is fixed. The capacity region of the Gaussian multiple- 
access channel with no power control is described as follows 
[18]: 



I.R G Mf : & < C (J2 HiPi > N °) ' 
for all S C M = {!,..., M}\, (3) 



C g (P,H) = 



where Pj and Ri are the 2-th transmitter's power and rate, re- 
spectively. C(P, N) denotes Shannon's formula for the capacity 
of additive white Gaussian noise (AWGN) channel given by 



C(P, N) = ilog(l + ^) nats. 



(4) 



For a multiple-access channel with fading, but fixed trans- 
mission powers Pi, the throughput capacity region is obtained 
by averaging the instantaneous capacity regions with respect to 
the fading process [19]: 

C a (P) = !r G : ^ ^ E « C (Y1 R i P i> N o] 

for all SC {1,...,M}|, (5) 

'This assumption is satisfied in practice when the receiver measures the 
channels and feeds back the channel information to the users. 
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where H is a random vector with the stationary distribution 
of the fading process. Let us define the notion of boundary or 
dominant face for any of the capacity regions defined above. 

Definition 1: The dominant face or boundary of a capacity 
region, denoted by J-{-), is defined as the set of all M-tuples 
in the capacity region such that no component can be increased 
without decreasing others while remaining in the capacity 
region. 

III. Resource Allocation for a Fading Channel 

The goal of the dynamic resource allocation problem is 
to find a rate allocation policy, 1Z, which is a map from 
the fading state h to the transmission rates, lZ(h) = 
(TZi(h), . . . ,lZM(h)). In the following we define the optimal 
rate allocation policy with respect to utility function, u(-). 

Definition 2: [Optimal Policy] The optimal rate allocation 
policy denoted by 1Z*(-) is a mapping that satisfies 1Z*(H) G 
C g (V*{H),H) for all H, such that 

E H [R,*(H)] = R* G argmax u(R) 

subject to ReC cl (P) (6) 
The utility function u(R) is assumed to satisfy the following 
conditions. 

Assumption 1: The following conditions hold: 

(a) The utility function u(R) is concave with respect to vector 
R. 

(b) u(R) is monotonically non-decreasing with respect to Ri, 
fori = l,...,M. 

(c) There exists a scalar B such that 

WdW < B, for all g G du{R) and all R, 

where du(R) denotes the subdifferential of u at R, i.e., 
the set of all subgradients %of u at R. 

(d) If R^ = argmax ReCi) (p H jii(R), then there exists a 
positive scalar A such that 

> A\\R}-R\\ 2 , for all R G C g (P, H). 

Assumption \V[c) imposes a bound on subgradients of the 
utility function. In this paper, it is sufficient to have a utility 
function with bounded subgradient only in a neighborhood of 
optimal solution, but weakening Assumption [Tf c) may require 

2 The vector g is a subgradient of a concave function / : D — * K at xq, if 
and only if f(x) — f(xo) < g'(% — ^o) for all % £ D. 



unnecessary technical details. Assumption |TJd) is a strong 
concavity type assumption which is satisfied for most of the 
utility functions. In fact, strong concavity of the utility implies 
Assumption Old), but it is not necessary. 

Definition 3: [Greedy Policy] A greedy rate allocation pol- 
icy, denoted by TZ, is given by 

H(H) = argmax u(R) 

subject to ReC g (P,H) (7) 

i.e., for each channel state, the greedy policy chooses the rate 
vector that maximizes the utility function over the correspond- 
ing capacity region. 

Note that the greedy policy is not necessarily optimal for 
general concave utility functions, i.e., the expected achieved 
rate does not maximize the utility over the throughput capacity 
region. However, the performance difference, i.e., utility differ- 
ence between the expected rates assigned by the greedy and the 
optimal policy, is bounded and the bounds can be characterized 
in terms of channel variations and the structure of the utility 
function [20]. 

The maximization problem in (0 is a convex program and 
the optimal solution can be obtained by iterative methods such 
as the gradient projection method with approximate projection 
studied in [21]. The fc-th iteration of this method is given by 

R k+i = fi( R k + a k g k^ g k e du ( R k^ (8) 

where g k is a subgradient of u at R k , and a k denotes the 
stepsize. V denotes the approximate projection operator which 
is defined in the following. 

Definition 4: Let X = {x G l^yla; < 6} where A has non- 
negative entries. Let y G ffi™ violate the constraint a[x < bi, 
for i G {ii, . . . ,ii}. The approximate projection of y on X, 
denoted by V, is given by 

V{y)=V il {...{V il _ 1 {Vi l {y)))l 

where Vi k denotes the exact projection on the hyperplane {x G 
R n \a' ik x = b ik }. 

An example of approximate projection on a two-user 
multiple-access capacity region is illustrated in Figure Q] 
Note that the result of projection is not necessarily unique. 
However it is pseudo-nonexpansive, i.e., the distance between 
any feasible point and the projected point is smaller than its 
distance to the original point. Under Assumption Q] and specific 
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X 



P(M) 



Fig. 1 . Approximate projection of Ft on a two-user MAC region 

stepsize rules, we established the convergence of the iterations 
in ^ to the optimal solution in (0 by using the pseudo- 
nonexpansiveness of the approximate projection (see [21], 
Proposition 2). We also showed by exploiting the polymatroid 
structure of the capacity region, each iteration in (0 can be 
computed in 0(M 3 log M) time. However, for each channel 
state, finding even a "near-optimal" solution of the problem in 
(Q requires a large number of iterations, making the online 
evaluation of the greedy policy impractical. In the following 
section, we introduce an alternative rate allocation policy, which 
implements a single gradient projection iteration of the form ^ 
per time slot. 

IV. Approximate Rate Allocation Policy 

In this section, we assume that the channel state information 
is available instantly at each time slot n, and the computational 
resources are limited such that a single iteration of the gradient 
projection method in ([8]) can be implemented in each time slot. 

Definition 5: [Approximate Policy] Given some fixed integer 
k > 1, we define the approximate rate allocation policy, 1Z, as 
follows: 

K(H(0))=K(H(0)), 



where 



and R{ £ 
iterations 



K{H(n)) = RJ, for all n > 1, 



argmax u(Rl), t = 
oti<A-i 



(9) 



(10) 



pM 



is given by the following gradient projection 



R 



i+i _ 



Pi 
Pi 



Tl(H(kt)) 



op g° 



i = i,...,fc-i, (ii) 



where gi is a subgradient of u(-) at RP , a 3 denotes the stepsize 
and P t is the approximate projection on C g (P, H(ktj). 



For k = 1, ( flTT l reduces to taking only one gradient 
projection iteration at each time slot. For k > 1, the proposed 
rate allocation policy essentially let the channel state change for 
a block of k consecutive time slots, and then takes k iterations 
of the gradient projection method with approximate projection. 
Note that to compute the policy at time slot n, we are using the 
channel state information at time slots kt, k(t—l), . . . Hence, in 
practice the channel measurements need to be computed every 
k time slots. 

There is a tradeoff in choosing k, because taking only one 
gradient projection step may not be sufficient to get close 
enough to the greedy policy's operating point. Moreover, for 
large k the new operating point of the greedy policy can be 
far from the previous one, and k iterations may be insufficient 
again. So the parameter k should be chosen optimally to obtain 
the best performance for the approximate policy. 

Before stating the main result, let us introduce some defini- 
tions and lemmas. 

Definition 6: Let Q be a polyhedron described by a set of 
linear constraints, i.e., 



Q = {x 6 R n : Ax < b} . 



(12) 



Define the expansion of Q by (5, denoted by £s(Q), as the 
polyhedron obtained by relaxing all the constraints in <fT~2t . i.e., 
£ S (Q) = {x G M™ : Ax < b + 61} , where 1 is the vector of 
all ones. 

Definition 7: Let X and Y be two polyhedra described by a 
set of linear constraints. Let £d(X) be an expansion of X by 
relaxing its constraints by d. The distance rfjj(X, Y) between 
X and Y is defined as the minimum scalar d such that X C 
£ d (Y) and Y C £ d (X). 

The next lemma shows that if the distance between two 
capacity regions is small, the distance between the optimal 
solutions of maximizing the utility function over these regions 
is also small. 

Lemma 1: Let Hi and H2 be two different channel states. 
Also, let R* and i?2 t> e tne optimal solution of maximizing a 
utility function over C g (P, Hi) and C g (P, if 2), respectively. 
If the utility satisfies Assumption [T] and 



d H (C g (P,Hi),C g (P,H 2 )) < S 



Then, we have 



\rI-r;\\<5? 



(!) 



(13) 
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Proof: See Appendix U ■ 
In the following lemma, we translate the model for the speed 

of fading in terms of channel state variations into changes in 

the corresponding capacity regions. 

Lemma 2: Let {[-Hi(n)]i=i Af} be the fading process that 

satisfies condition in (0. We have 

d H (c g (P,H{n + l)),C g (P,H{n))) <W n , (14) 

where {W„} are nonnegative independent identically dis- 
tributed random variables bounded from above by w = 
i 2~2iLi v l Pi, where v l is an upperbound on the process {V^} 
and Pi is the i-th user's transmission power. 
Proof: By Definition [7] we have 



d H C g (P, H(n + l)),C g (P, H(n)) 



max — 

S<ZM 2 



log (l + 



< max 



E ieS |ffi(n+l)-ffi(rO|Pi 



scm 2(1 + 2~2ies H i( n ) P i) 

M M 



(15) 



Therefore, d) is true for W n = \ £™i V n P i- since V n s are 
i.i.d. and bounded above by v l n , W n 's are i.i.d. and bounded 
from above by ^ 2~2iLi v % Pi- 



The following lemma by Nedic and Bertsekas [22] addresses 
the convergence rate of the gradient projection method with 
constant stepsize. 

Lemma 3: Let rate allocation policies TZ and TZ be given by 
Definition[3]and Definition^ respectively. Also, let Assumption 
Q]hold and the stepsize a n be fixed to some positive constant 
a. Then for a positive scalar e we have 

u(n(H(n))) > u(n(H(kt))) - (16) 



where k satisfies 



k > 



\R° t -H(H(kt))\f 



(17) 



Proof: See Proposition 2.3 of [22]. 
We next state our main result, which shows that the approx- 
imate rate allocation policy given by Definition [5] tracks the 
greedy policy within a neighborhood which is quantified as a 
function of the maximum speed of fading, the parameters of 
the utility function, and the transmission powers. 



Theorem 1: Let Assumption [TJ hold and the rate allocation 
policies TZ and 1Z be given by Definition [3] and Definition |3J 
respectively. Let fc = L(^§')^J an d fi x the stepsize to a = 
(■^§4-) 3 w'3 in Eq. ( fTTT ), where w' = w^(w^ + w is 

the upperbound on W n as defined in Lemma |2j A and B are 
system parameters depending on the structure of utility function 
as in Assumption [T2c),(d). Then, we have 

\\n{H{n)) -H(H{n))\\ < 2d = 2(^)i«/i (18) 
Proof: First, we show that 

\\n(H(n))-1Z(H(kt))\\<9 = {^w^, (19) 

where t = L i tt"J • The proof is by induction on t. For t = the 
claim is trivially true. Now suppose that ( fT~9b is true for some 
positive t. Hence, it also holds for n — k(t + 1) by induction 
hypothesis, i.e., 



\\B° +1 -TZ{H(kt))\\<e. 
On the other hand, by Lemma [TJ and Lemma [2] we have 

\\H(H(k[t + 1))) - K(H{kt)) || < kw' < 9. 
Therefore, by triangle inequality we have the following 



t+i 



K(H(k(t + 1))) || < 26>. 



(20) 



(21) 



(22) 



After plugging the corresponding values of a and 6, it is 
straightforward to show that ( fT7b holds for e = aB 2 . Thus, 
we can apply Lemma [3] to show 



(ll(H(n))) - u[n(H(k{t + 1)) 
By Assumption [TJd) we can write 



< aB 1 



\1Z{H{n))-K{H{k{t+l)))\ 



< 



,aB' 2 



(23) 



(24) 



Therefore, the proof of ( fT9T > is complete by induction. 
Again by applying Lemma [TJ and Lemma [2] we have 

\\H(H (n)) - K(H(kt)) || < kw' < 9, (25) 

and the desired result directly follows from (TT9b and $25[ by 
triangle inequality. ■ 
It is straightforward to show that the parameters k and a 
in Theorem [TJ are designed such that the smallest tracking 
neighborhood, 9, is obtained for the approximate policy pre- 
sented in Definition [5] with constant stepsize. The proof is by 
parameterizing 9, the size of the tracking neighborhood, in 
terms of k and minimizing 9(k) by relaxing k to be a real and 
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differentiating with respect to k. We eliminate the full proof for 
brevity. Theorem Q] provides a bound on the size of the tracking 
neighborhood as a function of the maximum speed of fading, 
denoted by w, which may be too conservative. It is of interest 
to provide a rate allocation policy and a bound on the size of 
its tracking neighborhood as a function of the average speed of 
fading. The next section addresses this issue. 

V. Improved Approximate Rate Allocation Policy 

In this section, we design an efficient rate allocation policy 
that tracks the greedy policy within a neighborhood char- 
acterized by the average speed of fading which is typically 
much smaller than the maximum speed of fading. We consider 
policies which can implement one gradient projection iteration 
per time slot. 

Unlike the approximate policy given by (O which uses the 
channel state information once in every k time slots, we present 
an algorithm which uses the channel state information in all 
time slots. Roughly speaking, this method takes fixed a number 
of gradient projection iterations only after the change in the 
channel state has reached a certain threshold. 

Definition 8: [Improved Approximate Policy] Let {Wn} be 
the sequence of nonnegative random variables as defined in 
Lemma [2] and 7 be a positive constant. Define the sequence 
{Ti} as 



To 

Ti+i 



= 0, 

= mm{* I > ] W n > 7}. 



t-i 



-Ti 



(26) 



Define the improved approximate rate allocation policy, TZ, 
with parameters 7 and k, as the following: 



where 



K{H(0))=K{H(0)), 
Tl{H{n)) = R T tl for all n > 1, 



t = max{i I Ti < n}, 

t = argmax u(Rj), 

Gkj<k-i 



(27) 

(28) 
(29) 



is given by the following gradient projection 



iterations 



R 



= P t 
= Pt 



K(H(T t )) 



a 3 g 3 



3=1, 



,k-l, (30) 



where g 3 is a subgradient of u(-) at R 3 , a 3 denotes the stepsize 
and P t is the approximate projection on C g (P, H(T t )). 

Theorem 2: Let t be as defined in d28l l. and let w denote the 
expected value of W n . If k = then we have 

Ti 

lim — = 1, with probability 1. (31) 

n^oo tk 

Proof: The sequence {Ti} is obtained as the random walk 
generated by W n 's cross the threshold level 7. Since W n 's 
are positive random variables, we can think of the threshold 
crossing as a renewal process, denoted by N(-), with inter 
arrivals W n . 

We can rewrite the limit as follows 



lim 

n — >oo 



n - N(tj) + N(tj) 
tk 



lim ■ 

n— >oc 



tk 



w- 



(32) 

Since the random walk will hit the threshold with probability 1, 
the first term goes to zero with probability 1 . Also, by Strong 
law for renewal processes the second terms goes to 1 with 
probability 1 (see [23], p.60). ■ 
Theorem [2] essentially guarantees that the number of gradient 
projection iterations is the same as the number of channel 
measurements in the long run with probability 1. 

Theorem 3: Let Assumption Q] hold and the rate allocation 
policies 71 and 1Z be given by Definition [3] and Definition [8j 
respectively. Also, let 7 = c(^)iwi, and k = [^-\ and fix 
the stepsize to a — in (l30l . where c > 1 is a constant 
satisfying the following equation 

(c 2 - l) 8 . 



2 8 c 4 



(33) 



Then 



\n(H(n))-H{H(n))\\<2 1+ (^. 



(34) 

Proof: We follow the line of proof of Theorem Q] First, 
by induction on t we show that 

\\K(H(n))-K(H(T t ))\\< 1 , (35) 

where t is defined in d28| ). The base is trivial. Similar to d20i i, 
by induction hypothesis we have 



\R> 



n(H(T t ))\\< r 



H+l 

By definition of Ti in d26l i we can write 

d H (C g (P, H(T t+1 )) , C g (P, H(T t )) 
Thus, by Lemma [T] we have 



< 



\\U(H(T t+1 )) - K(H(T t )) || < 7= (7 5 + (j) 5 )- 



(36) 



(37) 



(38) 
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Therefore, by combining (|36l l and <j38j by triangle inequality 
we obtain 

\\R° t+1 H(H(T t+1 )) || < 2 7 + (^)i (39) 

Using the fact that w < w — v 2i J , after a few steps of 
straightforward manipulations we can show that 



\\R t+1 -TZ(H(T t+1 ))\\ 2 < ( 27+ (^) 2 <c 4 ^. (40) 

Now by plugging in ( fTTT i the values of a and 7 in terms of 
system parameters we can verify that 



k = 



> 



\R° t+1 -K(H(T t+1 ))W 



or 



(41) 



Hence, we can apply Lemma [3] for e = A7 2 , and conclude 



i(n(H(n))) -u(n(H{T t+1 )) 
By exploiting Assumption [TJd) we have 



\n(H(n))-TZ(H(T t+1 ))\\<( 



aB 2 



< aB 1 



7- 



(42) 



(43) 



Therefore, the proof of ( f35l > is complete by induction. Similarly 
to d38T l we have 

\\n{H{n)) -n{H{T t ))\\ < 7 *( 7 * + (|)*), (44) 

and ( f34-b follows immediately from d35l l and (l44l) by invoking 
triangle inequality. ■ 
Theorem [2] and Theorem [3] guarantee that the presented 
rate allocation policy tracks the greedy policy within a small 
neighborhood while with probability 1, only one gradient pro- 
jection iteration is computed per time slot. The neighborhood 
is characterized in terms of the average behavior of channel 
variations and vanishes as the fading speed decreases. 

VI. Conclusion 

We study the problem of rate allocation in a fading multiple 
access channel with no power control from an information the- 
oretic point of view. Our goal is to approximate the optimal rate 
allocation policy, which yields an average rate that maximizes 
a general concave utility function of transmission rates over the 
throughput capacity region of the multiple-access channel. 

We present a dynamic rate allocation policy which takes 
a block of channel measurements and implements the same 
number of gradient projection iterations with approximate pro- 
jection at the end of each block. This rate allocation policy 



tracks the greedy policy within a small neighborhood whose 
size decreases as a function of the maximum speed of fading. 

In order to provide a bound on the tracking neighborhood in 
terms of average speed of fading, we present an alternative rate 
allocation policy. This policy adaptively selects variable block 
lengths for channel measurements using feedback information 
about the current channel states. It implements a fixed number 
of gradient iterations at the end of these blocks. We show that 
the ratio of the total number of channel measurements and the 
number of gradient iterations converges to 1 with probability 
one. We also provide a bound on the size of the neighborhood 
with which the new policy tracks the greedy policy as a function 
of the average speed of fading. The proposed dynamic rate allo- 
cation policies are efficiently implementable since they require 
a single gradient projection step per channel measurement and 
the projection can be done in time polynomial in the number 
of users. 
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Appendix I 
Proof of LemmaQ] 

Without loss of generality assume that u(R 2 ) > u(R[). 
To simplify the notations for capacity regions, let C\ = 
C g (P, Hi) be a polymatroid, i.e., 

Ci = I Re r*' :J2 R >- f( s 1> for a11 s M }> (45) 

for some submodular function f(S), and C2 be an expansion of 
Ci by 5. We first show that for every R G TiC^), there exists 
a vector R' G T{Ci) such that \\R - R'\\ < S, where 
denotes the dominant face of a capacity region as in Definition 
ffl 

Assume R is a vertex of C2. Then the polymatroid structure 
of C2 implies that R is the intersection of M constraints 
corresponding to a chain of subsets of M.. Hence, there is some 
k G M such that R k = f{{k}) + 5. Choose R' as follows 



R'i — 



Ri — (5, i = k 



Hi 



otherwise. 



(46) 



R' is obviously in a ^-neighborhood of R. Moreover, the 
constraint corresponding to the set M. is active for R' , so we 
just need to show that R' is feasible in order to prove that it 
is on the dominant face. First, let us consider the sets S that 
contain k. We have 

£^ = 5^ -*</(£). (47) 

ies i£S 

Second, consider the case that k ^ S. 

i€S i£Su{k} 

< f(SU{k}) + 5-R k 

< f(S) + f({k}) + S-R l 
- f(S). 

where the first inequality come from d47i i. and the second 
inequality is valid because of the submodularity of the function 
/(•)■ 

The previous argument establishes that the claim is true 
for each vertex Rj of the dominant face. But every other 
point R on the dominant face can be represented as a convex 
combination of the vertices, i.e., 

j i 
Using the convexity of the norm function, it is quite straight- 
forward to show that the desired R' is given by 



R ' = J2 a i R 'j-- 



where iL is obtained for each Rj in the same manner as in 
<|46]). 

So we have shown that there exists some R on the dominant 
face of Ci = C g (P, Hi) such that \\R* 2 -R\\ < 8. Thus, from 
the hypothesis and the fact that u(R%) > u(R*) > u(R), we 
have 

u{R* 2 )-u(R) = \u(R* 2 )-u(R)\ < B\\R*-R\\ < B5. (48) 

Now suppose that \\R1 — R\\ > (-^5)K hence by Assumption 
[TJd) we have 

u(R*i) - u(R) = \u(Rl) - u(R)\ > BS. (49) 



By subtracting (|48} from we obtain u(R* 2 ) < u(R\) 
which is a contradiction. Therefore, \\R* — R\\ < {jc$)*, and 
the desired result follows immediately by invoking the triangle 
inequality. 



